Polarity Inducing Latent Semantic Analysis
نویسندگان
چکیده
Existing vector space models typically map synonyms and antonyms to similar word vectors, and thus fail to represent antonymy. We introduce a new vector space representation where antonyms lie on opposite sides of a sphere: in the word vector space, synonyms have cosine similarities close to one, while antonyms are close to minus one. We derive this representation with the aid of a thesaurus and latent semantic analysis (LSA). Each entry in the thesaurus – a word sense along with its synonyms and antonyms – is treated as a “document,” and the resulting document collection is subjected to LSA. The key contribution of this work is to show how to assign signs to the entries in the co-occurrence matrix on which LSA operates, so as to induce a subspace with the desired property. We evaluate this procedure with the Graduate Record Examination questions of (Mohammed et al., 2008) and find that the method improves on the results of that study. Further improvements result from refining the subspace representation with discriminative training, and augmenting the training data with general newspaper text. Altogether, we improve on the best previous results by 11 points absolute in F measure.
منابع مشابه
Query expansion based on relevance feedback and latent semantic analysis
Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...
متن کاملUO_UA: Using Latent Semantic Analysis to Build a Domain-Dependent Sentiment Resource
In this paper we present our contribution to SemEval-2014 Task 4: Aspect Based Sentiment Analysis (Pontiki et al., 2014), Subtask 2: Aspect Term Polarity for Laptop domain. The most outstanding feature in this contribution is the automatic building of a domain-depended sentiment resource using Latent Semantic Analysis. We induce, for each term, two real scores that indicate its use in positive ...
متن کاملA multilingual semi-supervised approach in deriving Singlish sentic patterns for polarity detection
Due to the huge volume and linguistic variation of data shared online, accurate detection of the sentiment of a message (polarity detection) can no longer rely on human assessors or through simple lexicon keyword matching. This paper presents a semi-supervised approach in constructing essential toolkits for analysing the polarity of a localised scarce-resource language, Singlish (Singaporean En...
متن کاملAutomatic Software Clustering via Latent Semantic Analysis
1 This paper appears in the 14 IEEE ASE’99, Cocoa Beach FL, Oct. 12-15, pp. 251-254 Abstract The paper describes the initial results of applying Latent Semantic Analysis (LSA) to program source code and associated documentation. Latent Semantic Analysis is a corpus-based statistical method for inducing and representing aspects of the meanings of words and passages (of natural language) reflecti...
متن کاملSentiment Analysis in Student Experiences of Learning
In this paper we present an evaluation of new techniques for automatically detecting sentiment polarity (Positive or Negative) in the students responses to Unit of Study Evaluations (USE). The study compares categorical model and dimensional model making use of five emotion categories: Anger, Fear, Joy, Sadness, and Surprise. Joy and Surprise are taken as a Positive polarity, whereas Anger, Fea...
متن کامل